F Ramework for W Eb L Og D Ata Using a L Earning a Lgorithm
نویسندگان
چکیده
With the continued growth and proliferation of Web services and Web based information systems, the volumes of user data have reached astronomical proportions. Before analyzing such data using web mining techniques, the web log has to be pre processed, integrated and transformed. As the World Wide Web is continuously and rapidly growing, it is necessary for the web miners to utilize intelligent tools in order to find, extract, filter and evaluate the desired information. The data pre-processing stage is the most important phase for investigation of the web user usage behaviour. To do this one must extract the only human user accesses from weblog data which is critical and complex. The web log is incremental in nature, thus conventional data pre-processing techniques were proved to be not suitable. Hence an extensive learning algorithm is required in order to get the desired information.This paper introduces an extensive research frame work capable of pre processing web log data completely and efficiently. The learning algorithm of proposed research frame work can separates human user and search engine accesses intelligently, with less time. In order to create suitable target data, the further essential tasks of pre-processing Data Cleansing, User Identification, Sessionization and Path Completion are designed collectively. The framework reduces the error rate and improves significant learning performance of the algorithm. The work ensures the goodness of split by using popular measures like Entropy and Gini index. This framework helps to investigate the web user usage behaviour efficiently. The experimental results proving this claim are given in this paper.
منابع مشابه
ANTLIMA - A Listener Model with Mental Images
Star t ing f r om the thesis tha t the audience expects the speaker to mean the most typ ical case of the described class of events or si tuat ions w i t h respect to the communicated contex t , we explain a mechanism for representing and using typ ica l i t y d is t r ibut ions of static spat ia l relat ions which is related to Herskovits ' analyt ica l f ramework. Extended to restr ict ions o...
متن کاملبررسی تنوع ژنتیکی و گروهبندی ژنوتیپهای جو(Hordeum vulgare L.) از لحاظ مقاومت به بیماری سفیدک پودری (Powdery Mildew) در مرحلۀ گیاهچهای
این پژوهش بهمنظور ارزیابی تنوع 70 ژنوتیپ جو از لحاظ مقاومت به بیماری سفیدک پودری (powdery mildew) انجام گرفت. پس از تهیۀ نمونههای آلوده به بیماری، تکثیر اسپورهای قارچ بر روی رقم حساس افضل صورت گرفت. آزمایش در قالب طرح بلوکهای کامل تصادفی بهصورت گلدانی در گلخانه انجام گرفت. گیاهچهها در مرحلۀ دوبرگی با اسپورهای قارچ مایهزنی شدند. دوازده روز بعد صفات تیپ آلودگی و درصد آلودگی برگها براساس مق...
متن کاملOn Hypermomentum in General Relativity III. Coupling Hypermomentum to Geometry
I n Par t 1 1 of th is series we presented the not ion of the mate r ia l hypermomentum cur ren t and mot ivated i ts i n t r o d u c t i o n in to genera l re la t i v i t y . I n Par t I I 2 we showed tha t a general, l i near l y connected m a n i f o l d w i t h symmet r i c me t r i c ( L 4 , g) is the appropr ia te geometr ica l f ramework fo r such an i n t roduc t i on . T h e present p...
متن کاملRules for Pronominalization
R igorous i n te rp re ta t i on of p ronouns is possib le when syn tax , semant ics, and pragmat ics of a d iscourse can be reasonably con t ro l l ed . In te rac t ion w i t h a database prov ides such an env i ronmen t . In the f ramework of the User Spec ia l ty Languages system and Discourse Representat ion T h e o r y , we fo rmu la te s t r i c t and p re fe ren t ia l rules fo r p ronom...
متن کاملSiesmic Assessment og Ductility and Strength Capacities of Low-Rise R. C. Buildings
This paper presents a methodology for the assessment of ductility and strength capacities in low-rise buildings. This method utilizes the characteristics of force-displacement for the lowest story level or considers the weakest story in any given low-rise building for its primary analysis. Calculations are based on two levels of earthquake motions, namely strong earthquakes (PGA=0.3 g), and ver...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011